IS

Lin, Chienting

Topic Weight Topic Terms
0.247 office document documents retrieval automation word concept clustering text based automated created individual functions major
0.199 data classification statistical regression mining models neural methods using analysis techniques performance predictive networks accuracy
0.173 results study research information studies relationship size variables previous variable examining dependent increases empirical variance

Focal Researcher     Coauthors of Focal Researcher (1st degree)     Coauthors of Coauthors (2nd degree)

Note: click on a node to go to a researcher's profile page. Drag a node to reallocate. Number on the edge is the number of co-authorships.

Chen, Hsinchun 1 Nunamaker, Jr., Jay F. 1
document clustering techniques 1 experimental research 1 group support systems 1 self-organizing maps 1
unsupervised learning algorithms 1

Articles (1)

Verifying the Proximity and Size Hypothesis for Self-Organizing Maps. (Journal of Management Information Systems, 1999)
Authors: Abstract:
    The Kohonen Self-Organizing Map (SOM) is an unsupervised learning technique for summarizing high-dimensional data so that similar inputs are, in general, mapped close to one another. When applied to textual data, SOM has been shown to be able to group together related concepts in a data collection and to present major topics within the collection with larger regions. This article presents research in which the authors sought to validate these properties of SOM, called the Proximity and Size Hypotheses, through a user evaluation study. Building upon their previous research in automatic concept generation and classification, they demonstrated that the Kohonen SOM was able to perform concept clustering effectively, based on its concept precision and recall 7 scores as judged by human experts. They also demonstrated a positive relationship between the size of an SOM region and the number of documents contained in the region. They believe this research has established the Kohonen SOM algorithm as an intuitively appealing and promising neural-network-based textual classification technique for addressing part of the longstanding "information overload" problem.